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DECODING CIRCUIT, AND DECODER, DECODING METHOD AND 
SEMICONDUCTOR DEVICE THAT USE THE DECODING CIRCUIT 

BACKGROUND OF THE INVENTION 
Field of the Invention 

The present invention relates to a decoding circuit, a 
decoder, a decoding method and a semiconductor device, and, 
more specifically, to a decoding circuit that is especially 
effective for the correction of fast optical communication 
field errors, and a decoder, a decoding method and a 
semiconductor device that employ this decoding circuit. 

Brief Description of the Prior Art 

Importance of fast and superior error correction technique 
In consonance with the expansion of the Internet and the 
development of e-business, the rate of increase in the 
volume of data computers can handle and their speed has 
accelerated. Accordingly, there is a demand for increasing 
speed of data transfer among computers, and in line with 
this demand, optical communication that yields transfer 
speed of up to 4 0 Gbps is becoming popular. However, for 
such a fast communication method such as the optical 
communication method, to maintain an acceptable error rate 
at system level requires that the reliability of data 
communication be further increased in proportion to the 
amount of data processed by a computer. 
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Important techniques, called error correction coding 
techniques, have been devised to improve data reliability 
employing high-level mathematics to automatically correct 
errors caused by a variety of factors (e.g., noise along a 
channel) . Popular known techniques are Hamming code and 
Reed-Solomon code, both of which are frequently employed. 
Basically, Hamming codes correct single bit errors, but 
their correction capability is low. For instance, with 
Hamming codes, if a single bit error is detected, the error 
is corrected, but if the detected error covers two bits, 
only the error detection portion of the process is 
performed, no correction is made. However, setting up an 
error correction process for an error correction system 
using Hamming code is simple, and it is well known that by 
performing the error correction process in parallel, a 
processing speed that greatly exceeds 1 Gbps (one billion 
bits per second) can be obtained. 

On the other hand, Reed-Solomon code is a superior error 
correction technique possessing high correction capabilities 
and can be used to correct errors appearing as units 
(symbols) comprising multiple contiguous bits. At present, 
however, because of the complicated calculations that are 
frequently required, using Reed-Solomon code to perform 
correction processes in parallel is difficult; and even when 
pipeline processes using data having eight-bit width are 
performed at 100 MHz , for example, only processing speed of 
around 800 Mb/s can be obtained. Currently, therefore, 
since the speed attainable with conventional techniques 
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using Reed-Solomon codes is not suitable for fields in which 
high processing speed is required, these techniques are 
employed mainly for fields to which comparatively low data 
processing speed is acceptable, such as the low speed 
5 communication field and the data storage-unit field for the 
production of hard disks or secondary storage devices, 
CD-ROMs, for example. 

Error correction technique required by a f ast optical 

10 communication field 

As part of a fast optical communication technique for data 
communication by computers, by the recently popular Internet 
as a backbone, a terabit per second fast communication 
system that employs WDM (Wavelength Division Multiplexing) 

15 and DWDM (Dense WDM), which has improved wavelength division 
multiplexing levels, has been introduced based on the SONET 
technique, according to which frames having a predetermined 
length are synchronously and sequentially transmitted. 

20 As the wavelength division multiplexing levels for the above 
described optical data communication is increased, crosstalk 
occurs between wavelengths that are near each other. To 
cope with this crosstalk, FEC (Forward Error Correction) is 
employed as an error correction method for communication in 

25 long distance (Long Haul) optical wavelength division 
multiplexing. In ITU-T G.975, the ITU (International 
Telecommunication Union) standardized the use of interleaved 
(255, 239) RS code (code length n = 255 bytes) of m = 8 (8 
bits/symbols), and in G.709, the Digital Wrapper standard 
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for defining the FEC frame structure is employed. 

According to the Digital Wrapper standard, for example, 
low-speed serial Reed-Solomon code circuits are arranged in 
5 parallel to achieve a necessary processing capability, and 
for this, interleaving of Reed-Solomon codes is an 
indispensable technique. 

Prior art for fast and high-level error corr ection 
10 techniques 

Independent of the need for optical communication, parallel 
fast decoding using Reed-Solomon code has been studied using 
a combinational circuit. 

15 Fig. 1 is a diagram showing an example of a fast decoder 
that can be used for an error correction apparatus. 
The decoding circuit in Fig. 1 implements a procedure for 
increasing by a multiple of ten or more the decoding speed 
of one decoding circuit, and for performing, at 

20 substantially the comparable speed as that of Humming code, 
parallel decoding in an error correction process using the 
Reed-Solomon coding possessing high-level error correction 
capabilities. For the decoding circuit in Fig. 1, a new 
representation using the elementary symmetric functions is 

25 employed for decoding Reed-Solomon codes, so that an error 
value polynomial Er(x) of 0(t) -degree with which error 
values can be directly calculated is employed (t is the 
maximum number of correctable errors) . 
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Since the decoding circuit in Fig. 1 employs this 
polynomial, not only syndrome calculation and error location 
evaluation, but also error value evaluation can be directly 
obtained by performing calculations for a single polynomial. 

5 Therefore, compared with the conventional methods whereby 
these calculations are performed by Forney algorithm to 
divide evaluation results obtained for two polynomials, a 
greatly simplified operation can be used. Further, in the 
decoding circuit in Fig. 1, a representation appropriate for 

10 a combinational circuit is employed not only for the 
calculation of the coefficients of Er(x), but also for the 
calculation of the coefficients of the error locator 
polynomial A(x), so that not only can a higher processing 
speed be provided, but in addition, the number of required 

15 arithmetic circuits can be reduced. 

When the decoding circuit in Fig. 1 is employed, a random 
4-byte error correction circuit, which is mounted on an 
experimental base for a semiconductor using the standard 

20 0.35 um ASIC technique, can process in parallel, and at a 
low latency (45 ns), data having a 320-bit width, and a 
processing speed of 7 Gb/s (7 billion bits per second) can 
be obtained that is nearly ten times higher than the typical 
processing speed of 800 Mb/s available with a current serial 

25 decoding circuit. In addition, when a new circuit 

optimization algorithm specifically prepared for a large 
parallel error correction circuit and a circuit sharing 
method are employed for the decoding circuit in Fig. 1, the 
circuit size can be reduced. And furthermore, since the 
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decoding circuit in Fig. 1 is a combinational circuit that 
does not require an external controller and registers, in 
spite of the high processing speed that can be attained, 
power consumption can be reduced. 

5 

However, the decoding circuit in Fig. 1 can not provide a 
processing speed that equals 40 Gbps required for optical 
communication, and in order to cope with the 8-byte error 
correction standard established by the ITU, when the normal 
10 circuit sharing method is used the resulting circuit can be 
so large that it can not be mounted on a single chip. 

Fig. 2 is a schematic diagram showing the configuration of 
M an error correction circuit that employs a conventional 

**! 15 low-speed decoding method for optical communication. With 
Q this configuration, as the communication speed of an optical 
communication field increases, the conventional method 
whereby low-speed serial Reed-Solomon decoders are arranged 
in parallel becomes ever more inappropriate. Through 
Q 20 conventional RS decoders have a processing speed below 1 
Gbps, the decoding method in Fig. 2 achieves the necessary 
processing speed by an appropriate arrangement of low-speed 
serial Reed-Solomon decoders. However, according to the 
conventional method in Fig. 2, for such an arrangement of 
25 many Reed-Solomon decoders are required, and accordingly, 
the circuit size is increased in direct proportion to the 
data transfer speed used for optical communication. Fig. 3 
is a graph showing circuit size and data transfer speed 
plotted when the decoding method in Fig. 2 is employed. 



ri 

w 



6 



JP920010049US1 



M s 

TfO? 

p 



8 



Fig. 4 is a diagram showing another conventional decoding 
circuit (A. Patel, IBM J. Res. Develop., vol. 30, pp. 
259-269, 1986) . Since according to this conventional 

5 decoding method, the processing speed can be easily 
increased for the calculation of syndromes and error 
locations. However, since as is shown in Fig. 4 Forney 
algorithm is employed for the calculation of error value, 
two polynomials, i.e., the differential dA(x)/dx for the 

10 error locator polynomial and the error evaluator polynomial 
Q(x), which are obtained by the syndromes and the error 
locator polynomial, must be evaluated, and then divisions 
must be performed. This is a critical path that prevents an 
increase in output speed, and the processing speed can not 

15 be satisfactorily increased. 



According to OC-768 SONET, this is a large problem, because 
assuming the 16 interleave defined by ITU-G709 is employed 
as an input/output interface for the decoding circuit, a 
fit 20 fast processing speed of 300 MHz or higher is expected. 

Therefore, as one attempt, the decoder in Fig. 4 is employed 
and divisions corresponding to the critical path are 
converted into detailed pipelines to increase output speed. 

25 However, even when the process is converted into a pipeline, 
the decoding circuit in Fig. 4 must perform divisions at 
locations whereat no error is present, and the circuit size 
and the power consumption are increased as the pipeline is 
constructed. Further, to perform divisions only for error 
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locations, the locations must be calculated in advance, so 
that the error locations and the error values can not be 
calculated in parallel. In addition, for the decoding 
circuit in Fig. 4, a cycle count required for the output of 
the error values differs depending on whether an error is 
present. Therefore, when a synchronous frame, such as 
SONET, for sequential data must be input or output at high 
speed, it is difficult to output error values at high speed 
for a constant cycle, without depending on error patterns 
(number of errors and their locations) . 

Fig. 5 is a diagram showing an additional conventional 
decoder. When the parallel Reed-Solomon decoding method in 
Fig. 1 is employed for the optical communication field, 
because its circuit processing capability is superior to 
those of other conventional methods no problem occurs when 
non-interleaved RS code is used for an application. 
However, for interleaved Reed-Solomon codes, as defined by 
ITU-T G.975, since signals must be rearranged using a large, 
high-speed buffer and selector, the parallel Reed-Solomon 
decoding method is not always efficient. That is, the 
length of (255, 239) RS code is 2040 bits, and when a 16-byte 
interleaving process is performed, a 16-byte input and 
255-byte output serial/parallel converter and a 
parallel/serial converter for a 255-byte input and a 16-byte 
output are required, thereby considerably increasing the 
size of a circuit even though the processing speed can be 
increased to a required level. Therefore, it is difficult 
for the parallel Reed-Solomon decoding method to be provided 
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at a practical level for optical communication. 

For the calculation of error locations and error values used 
for the decoder, a large number of calculations in the 
5 Galois extension field GF(2 m ) must be performed at high 
speed, and further, the size of a circuit that can perform 
this processing must be such that it facilitates the 
implementation of the circuit. Conventionally, in the 
studies of the calculations over a Galois field, it is 
10 important that how efficiently a single calculation 
(multiplication or division) can be performed, and the 
b several tens to hundreds of calculations by a combinational 
*t circuit have almost never been discussed to date. As one of 

M various reasons this has not been done, it may be presumed 

ffi 

% 15 many decoding operations tend to be performed by sequential 
W circuits, and it has been ascertained that the use of a 

U combinational circuit provides little merit in terms of 

W processing capabilities and an acceptable circuit size. 

Ill 

® 20 Durinq the studies of the error correction calculation 

fit 

algorithm, the Yule-Walker equation that is defined for the 
Galois extension field GF(2 m ) is generated in decoding of the 
Reed-Solomon codes. The efficient processing of this 
Yule-Walker equation is desirable if high-speed processing 
25 is to be achieved and the size of the necessary circuit is 
to be minimized. When the algorithm for solving the 
Yule-Walker equation is performed by a combinational circuit 
to achieve high-speed processing, in as the required error 
correction capabilities increase, the portion of the circuit 
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used to solve the Yule-Walker equation and to locate errors 
becomes very significant from the viewpoint of the reduction 
in the size of the combinational circuit. 

5 In addition, when a combinational circuit that can carry out 
the decoding of Reed-Solomon codes is applied for an actual 
system, it is preferable that an algorithm be provided that 
can be applied for the decoding of Reed-Solomon codes having 
an arbitrary minimum distance in order to obtain a process 

10 that can be widely used and to remove superfluous, 
additional circuits or processes. Especially in the optical 
communication field, since the use of (255, 239) 
Reed-Solomon code is standardized by the ITU, an algorithm 
is required that can efficiently decode the Reed-Solomon 

15 code where the maximum number of correctable errors is 8 and 
the minimum distance is 17. 

In order to solve the mathematical problem posed by the 
Yule-Walker equation by using a hardware combinational 

20 circuit having a size that permits it to be implemented, 
increase in the circuit size must be suppressed, and an 
algorithm that can reduce the number of multipliers and a 
combinational circuit that can efficiently employ this 
algorithm are required. That is, a combinational circuit is 

25 needed that has an implementable size and that performs 
high-speed processing, and that includes the error 
correction device and the error correction algorithm 
described above. 
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SUMMARY OF THE INVENTION 

To resolve the above shortcomings, it is one object of the 
present invention to provide a decoding circuit, a decoder, 

5 a decoding method and a semiconductor device for efficiently 
processing interleaved Reed-Solomon codes in order to 
perform fast optical communication (40 Gbps or higher), or 
more specifically, to perform SONET communication using 
wavelength division multiplexing, in which sequential data 

10 are transferred as synchronous frames. 

That is, according to the invention, provided is a decoding 
circuit which has a high processing capability (low latency 
and high throughput) . 

15 

It is another object of the present invention to provide a 
flexible decoding circuit that can process interleaved 
Reed-Solomon codes without the above described 
characteristics being lost. 

20 

It is an additional object of the present invention to 
provide a decoding circuit for sequentially outputting error 
words at high speed and at a constant cycle rate, regardless 
of error patterns (error values and error locations) of the 
25 individual interleaved words that are received. 

It is a further object of the present invention to provide a 
decoder that includes the above described decoding circuit 
and a combinational arithmetic circuit, circuits over a 
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Galois extension field GF(2 m ) (m is an arbitrary natural 
number equal to or greater than 2), that has several common 
inputs and performs multiple multiplications (e.g., AB, AC 
and AD) and a circuit that performs logical sums such as AB 
5 + CD + EF + . • can be implemented as small, fast, 
efficient processing circuits, and to provide a decoding 
method and a semiconductor device employing this decoding 
circuit . 

10 To perform the decoding of Reed-Solomon codes by a 
combinational circuit and to apply this combinational 
circuit to an actual system, it is a still further object of 
the present invention to provide a decoding circuit that can 
perform a flexible process that can be applied for the 

15 decoding of Reed-Solomon codes having an arbitrary minimum 
distance and interleave configurations without an additional 
circuit or process being required, and to provide a decoder, 
a decoding method and a semiconductor device using this 
decoding circuit. 

20 

The above objects can, according to the present invention, 
be achieved by providing a decoding circuit, a decoder, a 
decoding method and a semiconductor device. 

25 According to the present invention, a decoding circuit 
comprises : 

an input unit for entering coded digital signals in 
parallel in accordance with the number of interleaved codes; 
a processor, including an error locator polynomial 
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calculator and an error value polynomial calculator, for 
processing data obtained serially from the interleaved codes 
that are received by the input unit; and 

an output unit for correcting errors by employing the 

5 output data that are received from the processor and the 
digital signals, and for outputting the obtained digital 
signals in parallel in accordance with the number of 
interleaved codes. In the decoding circuit of the 

invention, the input unit calculates syndromes for the input 

10 digital signals, and transmits the syndromes to the 
processor as data obtained serially from interleaved codes; 
the processor employs the syndromes to calculate 
coefficients of error locator polynomial and coefficients of 
error value polynomial; and the output unit, based on 

15 coefficients of the error locator polynomial and 
coefficients of the error value polynomial received from the 
processor, generates error locations and error values, using 
a linear calculation in a Galois extension field for the 
input digital signals, and defines the error locations and 

20 the error values as the digital signals to be output. In 
the decoding circuit of the invention, the input digital 
signals are Reed-Solomon codes that are received in parallel 
through an i-channel, and the decoding circuit includes a 
ratio i:l or l:i multiplexer or demultiplexer. For optical 

25 communication in the decoding circuit of the invention, 
wavelength division multiplexing is used for the 
transmission of the input digital signals. In the decoding 
circuit of the invention, at the least, the input unit is 
constituted by a sequential circuit and the processor is 
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constituted by a combinational circuit. According to the 
present invention, the decoding circuit can be used for the 
correction of digital signal errors or for encryption. 

5 According to the present invention, a decoder comprises: 
input means, for receiving coded digital signals; 
processing means, for processing the coded digital 
signals and for calculating coefficients of error locator 
polynomial and coefficients of error value polynomial; and 
10 output means, for outputting digital signals for which 

n errors have been corrected using coefficients of the error 
*? locator polynomial and coefficients of the error value 

i rs 

m polynomial, 

W wherein the input means receives in parallel, in 

p 15 accordance with the number of interleaved codes, the coded 

* n digital signals, and employs the coded digital signals to 

|5 calculate syndromes as data obtained serially from 

n 

interleaved codes, 
O wherein the processing means employs the syndromes 

20 output by the processing means to calculate coefficients of 
the error locator polynomial and coefficients of the error 
value polynomial, and 

wherein the output means employs coefficients of the 
error locator polynomial, coefficients of the error value 

25 polynomial and the coded input digital signals to correct 
errors using a linear calculation in a Galois extension 
field, and outputs in parallel the obtained digital signals 
in accordance with the number of interleaved codes. In the 
decoder of the invention, the input digital signals are 
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Reed-Solomon codes that are received in parallel through an 
i-channel, and the decoder includes a ratio i:l or l:i 
multiplexer or demultiplexer. For optical communication in 
the decoder of the invention, wavelength division 

5 multiplexing is used for the transmission of the input 
digital signals. In the decoder of the invention, at the 
least, the input unit is constituted by a sequential circuit 
and the processor is constituted by a combinational circuit. 
According to the present invention, the decoder can be used 

10 for the correction of digital signal errors or for 
encryption. 

According to the present invention, a method for decoding a 

digital signal comprises: 
15 an input step of entering coded digital signals in 

parallel in accordance with the number of interleaved codes; 
a process step of employing a processor, including an 

error locator polynomial calculator and an error value 

polynomial calculator, to process data obtained serially 
20 from the interleaved codes that are received at the input 

unit ; 

a generation step of employing the output data that are 
received from the processor and the digital signals to 
generate digital signals for which an error has been 
25 corrected; and 

an output step of outputting the obtained digital 
signals in parallel in accordance with the number of 
interleaved codes. In the decoding method of the invention, 
the input step includes the step of: 
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calculating syndromes for the input digital signals, 
and transmitting the syndromes to the processor as data 
obtained serially from interleaved codes, 

wherein the process step includes the step of: 
5 employing the syndromes to calculate coefficients of 

error locator polynomial and coefficients of error value 
polynomial, and 

wherein the output step includes the step of: 
based on coefficients of the error locator polynomial 
10 and coefficients of the error value polynomial received from 
the processor, generating error locations and error values, 
C3 using a linear calculation in a Galois extension field for 

S the input digital signals, and defining the error locations 
N* and the error values as the digital signals to be output. 

fti 

3 15 The decoding method of the invention further comprises: 

(3 a i:l multiplexing step and a l:i demultiplexing step, 

f'| wherein the input step includes the steps of: 

W receiving the input digital signals that are 

m Reed-Solomon codes, in parallel through an i-channel. For 

S 20 optical communication by the decoding method of the 

F t?- 

invention, wavelength division multiplexing is used for the 
transmission of the input digital signals. In the decoding 
method of the invention, at the least, the input step is 
used for the calculation using a sequential circuit and the 
25 process step is used for the calculation using a 
combinational circuit. According to the present invention, 
the decoding method can be used for the correction of 
digital signal errors or for encryption. 
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According to the present invention, a semiconductor device 
used to process a digital signal comprises: 

input means, for receiving coded digital signals; 

processing means, for processing the coded digital 
5 signals and for calculating coefficients of error locator 
polynomial and coefficients of error value polynomial; and 

output means, for outputting digital signals for which 
errors have been corrected using coefficients of the error 
locator polynomial and coefficients of the error value 
10 polynomial, 

wherein the input means receives in parallel, in 
accordance with the number of interleaved codes, the coded 
digital signals, and employs the coded digital signals to 
calculate syndromes as data obtained serially from 
15 interleaved codes, 

wherein the processing means employs the syndromes 
output by the input means to calculate coefficients of the 
error locator polynomial and coefficients of the error value 
polynomial, and 

20 wherein the output means employs the error location, 

the error value and the coded input digital signals to 
perform an operation in a Galois extension field, and 
outputs in parallel the obtained digital signals in 
accordance with the number of interleaved codes. In the 

25 invention, the input digital signals are Reed-Solomon codes 
that are received in parallel through an i-channel, and the 
semiconductor device includes a ratio i:l or l:i multiplexer 
or demultiplexer. For optical communication in the 

invention, wavelength division multiplexing is used for the 
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transmission of the input digital signals. In the invention 
the input means is constituted by a sequential circuit and 
the processing means is constituted by a combinational 
circuit. In this invention, the combinational circuit 
comprises : 

a plurality of multipliers, independently performing 
two or more multiplications for coded digital signals in a 
Galois extension field GF (2 m ) (m is an integer equal to or 
greater than 2 ) , 

wherein the multipliers include 

an input side XOR calculator, 

an AND calculator, and 

an output side XOR calculator, and 

wherein the multipliers share the input side XOR 
calculator. Further, in this invention, the combinational 
circuit comprises: 

a logical sum calculator for a Galois extension field 
GF(2 m ) (m is an integer equal to or greater than 2), and 

the multipliers each include an adder connected between 
the AND calculator and the output side XOR calculator, 

wherein the output side XOR calculator is used in 
common, and 

wherein the outputs of the AND calculators in the 
multipliers are added by the adders, and the addition 
results are calculated by the output side XOR calculator 
that is used in common. In the semiconductor device of the 
invention, the input of the multipliers is commonly used, 
and the input side XOR calculator is used in common by the 
multipliers . 
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The semiconductor device of this invention is used for an 
error location calculator, for calculating an error location 
for a digital signal transmitted using wavelength division 
5 multiplexing, and for an error value calculator. The 
semiconductor device of this invention is used for decoding, 
error correction or encryption. 
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BRIEF DESCRIPTION OF THE DRAWINGS 
10 Fig. 1 is a diagram showing a conventional decoder. 

Fig. 2 is a diagram showing a conventional error correction 
device for optical communication. 

Fig. 3 is a graph obtained by plotting a conventional 
circuit size and a data transfer speed. 
15 Fig. 4 is a diagram showing another conventional decoding 



M circuit. 



HI Fig. 5 is a diagram showing an additional conventional 

decoder. 

y 

Cfl Fig. 6 is a schematic diagram showing a decoder according to 

20 one embodiment of the present invention. 

Fig. 7 is a diagram showing a multiplier having a 
conventional configuration . 

Fig. 8 is a diagram showing a multiplier having another 
conventional configuration . 
25 Fig. 9 is a diagram showing a multiplier having an 
additional conventional configuration . 

Fig. 10 is a diagram showing the embodiment wherein the 
present invention is applied for the multiplier in Fig. 7. 
Fig. 11 is a diagram showing the embodiment wherein the 
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present invention is applied for the multiplier in Fig. 8. 
Fig. 12 is a diagram showing the embodiment wherein the 
present invention is applied for the multiplier in Fig. 9. 
Fig. 13 is a graph obtained by plotting the circuit size and 
5 the number of multiplier when the multiplier according to 
the invention is employed. 

Fig. 14 is a diagram showing a conventional error locator 
polynomial . 

Fig. 15 is a diagram of a formula, according to the 
10 invention, established for the Yule-Walker eguation. 

Fig. 16 is a diagram showing the detailed structure of r ( / +1) 
according to the invention. 

Fig. 17 is a diagram showing the detailed calculation 
results obtained, according to the invention, for decoding 
15 Reed-Solomon code. 

Fig. 18 is a schematic flowchart for an error correction 
algorithm according to the present invention. 

Fig. 19 is a schematic diagram showing the configuration of 
a Reed-Solomon code decoder according to the invention. 
20 Fig. 20 is a schematic block diagram showing the 
configuration of an error correction device according to the 
invention. 



25 DETAILED DESCRIPTION OF THE INVENTION 

The preferred embodiment of the present invention will now 
be described by referring to the accompanying drawings. 
Note, however, that the present invention is not limited to 
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this embodiment. 



Section 1 Decoding circuit 

Fig. 6 is a diagram showing a decoder according to the 
5 present invention that can be used to correct errors in 
digital signals received through optical communication. The 
decoder in Fig. 6 includes an input unit 10, a processor 12 
and an output unit 14. The input unit 10 receives a 16-byte 
interleaved digital signal ID. The processor 12 processes a 
10 signal received from the input unit 10 and calculates 
M coefficients of an error locator polynomial and coefficients 

of an error value polynomial. And the output unit 14 
obtains an AND of the A (x , evaluation result and E r(x) 
evaluation result that are generated from the data received 
15 from the processor 12, performs an XOR process with the AND 
* result and the input digital signal ID, and generates a 

digital signal OD. The digital signal OD is a signal in 
C3 which errors that may have been included in the digital 



signal have been corrected. 



ru 20 



In this invention, the digital signal ID input to the 
decoder in Fig. 6 can be one for which optical communication 
was used for its transmission, especially a signal 
transmitted by wavelength division multiplexing at a high 
25 data transfer rate of 40 Gbps. More specifically, the input 
digital signal ID can be, for example, a signal that is 
transmitted as (255, 239) RS code having a length of 2040 
bits. Generally, for wavelength division multiplexing, by 
using the interleave method, the input digital signal is 
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transmitted, for example, as a 16-byte parallel stream of 
255 bytes to the decoding circuit of the invention. 

In the decoder design in the invention in Fig. 6, input 
5 digital signals ID are interleaved and are input in 
parallel. A reception polynomial is defined for each input 
signal, and a syndrome Si is calculated using this 
polynomial and is output by the input unit 10. For (255, 
239) RS code, the syndrome Si output by the input unit 10 
10 (syndrome calculator) is generated as 16-byte digital 

|* information obtained from the 255 byte input digital signal. 

y In the decoder for this embodiment in Fig. 6, the input 

digital signal of 2040 bits is interleaved, and the 16 
255-byte serial streams that are obtained are transmitted to 

„p. 15 the input unit 10, while sixteen 16-byte syndromes 
corresponding to the 16 serial streams are generated by the 

s 

£5 input unit 10. 

W 

Cm As is shown in Fig. 6, in the input unit 10, one syndrome 

13 

n'i 20 calculator 16 is allocated for a serial stream IDS for each 
input digital signal ID, and calculates syndromes. The 
obtained syndromes for each serial stream IDS are stored in 
a register 18, and are output to the processor 12 one by 
one. For example, as is described above, 16 of 128 bit 
25 signals are obtained using the 16 interleaved 255 byte input 
digital signals in order to calculate error locations and 
error values. The syndrome calculator 16 that can be used 
for the input unit 10 in Fig. 6 can be any well known 
circuit, such as a circuit employing a sequential circuit. 
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The method used for defining the syndromes and the 
calculation of the syndromes will be described later in 
detail . 

5 In the decoder in Fig. 6, the obtained syndromes Si are 
sequentially transmitted to the processor 12. For 
convenience sake, for the explanation only one processor 12 
is shown in Fig. 6. The processor 12 includes: an error 
locator polynomial calculator 18, which is a combinational 

10 circuit constituted by multiple multipliers and which is 
used to calculate an error locator polynomial A(x); and an 
error value polynomial calculator 20, which is used to 
calculate an error value polynomial Er(x). The 
combinational circuit that is used in the processor 12 and 

15 that includes multipliers will be described in more detail 
in Section 2 Combinational circuit>. 

The error locator polynomial A(x) and the error value 
polynomial Er(x), which are output by the processor 12 in 

20 Fig. 6, are demultiplexed by a demultiplexer (not shown) so 
as to, for example, correspond to the number of interleaves, 
and the results are transmitted to the output unit 14. The 
output unit 14 includes registers 22, and AND gates 24 and 
XOR gates 26, which are arranged in a number equivalent to 

25 the interleaves of input digital signals. The output unit 
14 selects error values E r from the AND gates 24 by using the 
error location data A eV ai ("1" for an error location or "0" 
otherwise) that are obtained by the syndromes. The selected 
error values E r are added by the XOR gates 26, and each XOR 
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gate receives, via buffers 28a and 28b of imn bits, a serial 
stream IDS obtained from the coded digital signal ID. When 
the subtraction of the Galois extension field GF(2 m ) is 
performed by the XOR gate 26, the 255 byte digital signal OD 
5 from which an error has been removed is output. 

In the decoder in Fig. 6 for this invention, the 
combinational circuit constituting the processor 12 can be a 
sequential circuit. However, in this invention, especially, 

10 a plurality of multipliers may be constituted by three 
stages, an input side XOR calculator group (variable 
pre-processors) , an AND calculator group and an output side 
XOR calculator group (residual calculators) . When either 
one or both of the variable pre-processor and the residual 

15 calculator are used in common by multipliers, the size of 
the processor 12, which is the critical portion of the 
conventional calculation of error locations and error 
values, is practically allowable, and the processor 12 can 
be efficiently constituted by multipliers. 

20 

In the decoder of the invention in Fig. 6, the output unit 
14 is constituted only by circuits, such as a constant 
multiplier and an adder, performing linear calculations, 
without a circuit being used to perform non-linear 
25 calculations that reduce the processing speed. Therefore, 
for this invention a decoding circuit having a small circuit 
configuration can be provided that performs a process 
without deteriorating the processing speed. Further, 
through the careful study to provide this invention, the 
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present inventors found that a decoding circuit that is more 
flexible, faster and smaller than a conventional circuit 
could be constituted by using a combinational circuit that 
includes multipliers having a specific configuration, and an 
algorithm that permits the combinational circuit to 
efficiently calculate error locations and number of errors. 

The method or the algorithm of the invention for the error 
locator polynomial calculation will, along with the 
combinational circuit, be described in detail later. An 
explanation will now be given for the function and the 
operation of the error value polynomial calculator 20 
included in the processor 12 of the decoding circuit of the 
invention. 

Selection of an error value calculation algorithm 
According to the invention, an algorithm that can directly 
evaluate not only error locations but also error values 
through the calculation of the 0(t) -degree polynomial 
(linear calculation) is applied for the decoding of 
interleaved Reed-Solomon codes. At this time, in the 
algorithm used for this invention, the division necessary 
for an error value calculation is not performed for each 
error location on a critical path that is output following 
the evaluation of a polynomial, but by using only one 
calculation for each code word before the evaluation of a 
polynomial. Therefore, the values obtained by the 

polynomial evaluation can be directly output as error values 
at high speed at a constant cycle rate. 
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Further, it has been found that the degree of the error 
value polynomial Er(x) can be reduced to the t-1 degree that 
is the least required to obtain t independent outputs. At 

5 this time, t coefficients can be calculated by using the 
coefficients of the error locator polynomial and the 
syndromes. By using this algorithm, not only the syndrome 
calculation and the error location evaluation, but also the 
error value evaluation can be performed merely by a linear 

10 operating circuit. Thus, the entire input/output circuit 
can be simplified and the processing speed can be increased. 

Various error value polynomials can be used for the decoding 
algorithm or the decoding method of the invention. In the 
15 explanation of the function and the operation of the 
invention, the following example is employed wherein when e 
errors have occurred at i Q , . . . and i e -i the error value 
polynomial Er(x) is provided as 

j<k 

20 and where, while the division is included, the denominator 
is not a polynomial but is a constant for each code word (in 
the above equation, a denotes the primitive element of the 
Galois extension field) . 

25 The error value polynomial Er !e) (x) can not be used as it is 
for the decoding circuit in this invention, because the k-th 
error value E ik (hereinafter referred to simply as E lk ) , 
which is present at location a 1 " (hereinafter referred to 
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simply as a 1 * ) , is required for the calculation of the 
polynomial Er (e) (x) , and this makes no sense for the purpose 
of this invention. However, if all error values E ik in 
Er (e) (x) and locations a 1 " are written by the syndromes Si, 
5 this process can be provided as a circuit. Further, when 
all a 1 * s in Er (e) (x) are described by the error polynomial Af 
the error value can be calculated before the error 
locations are acquired. Therefore, the calculation of error 
values can be performed in parallel, and as a result, the 
10 processinq speed can be increased. 

The above process can be performed by usinq the followinq 
decodinq alqorithm or decodinq method. First, the 

denominator is written by Af . The coefficients of the 
15 error value and error locator polynomials are elementary 
U symmetric functions for error locations 

C3 a u f a 11 , a i2 , . . . , a u ~ l 

S excluding a constant factor. For example, for coefficients 

S of the error locator polynomial, Ai, A 2 , - . -/ A t , the 

Q 

III 20 following equations are established: 

k 

k<l 

The error locations a l \ a il , a 1 * , . . . , a 1 ^ can be exchanqed with 
each other. Further, the followinq new relation 
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f<«) = n(a i ' + a lj ) 



Af A ( 3 e) - 0 
1 A< e) A? - 
0 A ( x e) A ( 3 e) - 




0 
0 



0 
0 




can be obtained by a relation (e.g., see "Symmetric 
Functions and Orthogonal Polynomials", I . G. Macdonald, 
American Mathematical Society, 1998) that is established 
between the Elementary symmetric function and the Shur 
function defined by the division of two Vandermonde 
matrixes, and the exchange relation, (a + b) 2 = (a - b) 2 = a 2 
+ b 2 , that is established between the addition and the square 
performed for the Galois extension field GF(2 m ) . 

When the above determinant is employed, the denominator of 
Er (e > (x) can be written by Af . When the numerator of Er (e) (x) 
is calculated in the same manner, as is indicated by the 
following equation, the coefficients of Er (e > (x) can be 
written merely by Si and A< e) , and can be calculated without 
using E ik and a ik . 
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Er^(x) 

ZE^Uix+a 11 ) n (a^+a 1 *) 

k=0 j*k m, l*k,ni>l 

_ __. JJ/ a i„ + a i^ 

= 1 f(e)(Af,A ( 2 e) ,...,A ( e e) ) 

1 E ik a^ErfM\ A?>, ^) 

k f j r m=0 

f(e)(Af,A ( 2 e) , ...,A ( e e) ) 

2 S m x j Er^liA^ , A ( 2 e) , . . . , A ( e e) ) 

j r m=0 

f(e)(A ( x e) ,A ( 2 e) , ...,A ( e 6) ) 

|j where A%=Af +a^Af Uk . Specifically, this coefficient, is 
%U that of the error locator polynomial that corresponds to the 
m error location except a ik . 

*t* 5 

" f As is described above, when these new relations are 
?? employed, the output unit 14 can be constituted by a linear 
Q operating circuit for the Galois extension field GF (2 m ) , 

^ without a non-linear operating circuit being required, and 

C3 

ry 10 the processing speed can be increased. 

The following configuration is employed to constitute a 
fast, small coding circuit that can also cope with 
interleaved codes based on the above error value calculation 
15 algorithm. 

m Use of fast input unit and output unit (linear 
operating circuit) 

First, in consideration of the structure of the code (the 
number of interleaves), the bus width of an input /output 
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interface and the number of clocks, a polynomial evaluation 
(linear operating) circuit, which is connected to the input 
side to calculate syndromes, and a polynomial evaluation 
(linear operating) circuit, which is connected to the output 
5 side to evaluate error locations and error values, are 
implemented as fast sequential circuits that employ a cyclic 
structure that comes from the fact that RS codes are cyclic 
codes, and that perform a pre-process and a post-process for 
(n, k) Reed-Solomon code at an arbitrary number of clocks 
10 from 1 to n. Especially effective for the invention is the 
fact that the above described structure is also employed for 

n 

J5J an error value evaluation portion that is a critical path 

fear 

tO for the conventional method. According to the structure of 

M 

m the invention, a decoding circuit can be provided as an 

*Ff 15 interface that can flexibly cope not only with an input 
g digital signal that has 255 bytes for each code word, but 

also with ones that have 1, 3, 5, 15, 17, 51 and 85 bytes. 

W 

o 

jjj Table 1 shows the relation between the width of a digital 

ry 20 signal input to the decoding circuit and the processing 
clock required for this input width. 

Table 1: The input/output width and a clock count for each 
code word (n = 255) 



25 



Input /output 
width (bytes) 


1 


3 


5 


15 


17 


51 


85 


255 


Clock count 
required for 
processing a code 
word (n = 255) 


255 


85 


51 


17 


15 


5 


3 


1 



As is shown in Table 1, although the required number of 
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clocks is increased as the width of the input digital signal 
is reduced, the decoder of the invention can flexibly cope 
with it. 

In addition, in this invention, an arbitrary input/output 
byte width other than those in Table 1 can be selected for 
each code word. For example, an input/output width of eight 
bytes can be coped with when the code length is n = 256 
bytes by adding a one byte dummy at the end. 

(2) Connection to a non-linear operating circuit 12 
In this invention, a plurality of input units 10 and output 
units 14 that are constituted by sequential circuits are 
prepared in a number equivalent to the interleaves. Between 
these units, the non-linear operating circuit 12 is 
connected that calculates the coefficients of an error 
locator and error value polynomials by a multiplexer, a 
demultiplexer and a circuit for holding the syndromes and 
the coefficients of an error locator and error value 
polynomials. As is described above, this operating circuit 
12 is constituted as a combinational circuit for a 
non-linear operating circuit, such as a multiplier that 
performs non-linear calculation. Therefore, in this 

invention, in the case where ITU-T G.709 (255, 239) 
Reed-Solomon code is obtained by 16-byte interleaving, 16 
sequential circuits must be prepared before and after the 
operating circuit 12. In other words, in this case, 
multiplexing and demultiplexing must be performed at a ratio 
of 16:1. However, in this invention, in order to multiplex 
and demultiplex the coefficients of the syndrome polynomial, 
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only a signal of 128 bits (or 136 bits at the succeeding 
state, depending on how A(x) is defined) need be processed 
instead of 2040 bits, so that the required numbers of 
multiplexers, demultiplexers and buffers can be greatly 
5 reduced. 



(3) Three-stage pipeline operation system that employs the 
non-linear operating circuit in the center in a time sharing 
manner 

10 According to the present invention, the entire decoder is 
operated as a three-stage pipeline formed from a linear 
operating circuit (syndrome calculation) , a non-linear 
operating circuit (calculation of coefficients for error 
locator and error value polynomials) and a linear operating 
15 circuit (evaluation of error locations and error values), 
and the non-linear operating circuit of a low latency is 
employed in a time sharing manner by serially providing 
syndromes for code words for the calculation of interleaved 
code. Therefore, it is possible to provide efficient 
20 decoding of interleaved codes whose processing capability 
per circuit size is high. In the OC-768 case, for example, 
in order to operate the three-stage pipeline, the non-linear 
operating circuit in the center must complete the process 
for each interleaved code word at a latency of about 40 ns. 
25 When the combinational circuit is implemented by the most 
advanced semiconductor technique (0.18 jam or better), the 
above decoder can be provided as an ASIC semiconductor 
device . 
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More specifically, in the invention, fast decoder and error 
correction devices, the sizes of which fall within an 
acceptable range, can be provided by synergistic effects 
obtained by: (1) the use of a fast input unit 10 and a fast 

5 output unit 14, especially the use of only a linear 
operating circuit when calculating error locations and error 
values, the application of the above described algorithm for 
the processor 12, and the use of a configuration for a 
multiplier that can reduce the circuit size, and (2) the 

10 employment of the three-stage pipeline operation system by, 
in a time sharing manner, the processor 12, which 
essentially performs a non-linear calculation. In section 
2, which follows, an explanation will be given for a 
combinational circuit that comprises multipliers included in 

15 the processor 12 of the decoder of the invention. 

Section 2 Combinati onal circuit 

The processor 12 used for the decoder of the invention is a 
nonlinear circuit, specifically, a combinational circuit 

20 that uses multipliers. Unlike multipliers used for a 
conventional combinational circuit that has two stages, 
consisting of an AND calculator group and an XOR calculator 
group, in order to perform the multiplication of the Galois 
extension field GF (2 m ) , the multiplier used for this 

25 invention has three stages, consisting of XOR gates, AND 
gates and XOR gates. 

Configuration of a single parallel-multipl ier 

While many studies have been made of a single multiplier, -a 
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parallel multiplier (Mastrovito Multiplier) , which is 
constituted as a combinational circuit, not as a sequential 
circuit, is a field of active research. For a conventional 
parallel multiplication circuit (hereinafter, in this 
5 specif ication, referred to simply as a multiplier) , there 
are two configuration types: the AND-XOR type and the 
XOR-AND-XOR type, which can be converted into each other. 
It should be noted, however, that the AND-XOR type is 
generally employed when a circuit is provided for only a 
10 single multiplication. This is because while the AND-XOR 
m type has been well studied and various methods for obtaining 

;~: a small circuit have been proposed, there is no guarantee 

that the circuit size for the XOR-AND-XOR type will be 
* n reduced (or may be increased) , and the reduction effects can 

15 not be obtained that would compensate for the expenditure of 
the effort a complicated design operation would entail. The 
AND-XOR type and the XOR-AND-XOR type will be further 
«i described. 
Cfi 



it* 



20 (1) AND-XOR type 

This is a typical method used for performing calculations in 
the same manner as are calculations performed with figures 
written down on paper, and generally a circuit of this type 
is employed. Specifically, the coefficients of two (m-l)th 

25 degree polynomials, which are multiplication arguments, are 
combined to prepare m 2 partial products. This is the 
processing performed by the AND section. Then, partial 
products that have the same degree are added together to 
form a (2m-2)th degree polynomial, and a residue operation 
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using an irreducible polynomial is performed to obtain the 
(m-l)th degree solution. This is the processing performed 
by the XOR section. The number of ANDs is m 2 and the number 
of XORs is 0(m 3 ), and it is widely known that (m 2 - 1) XORs 
5 can be obtained by selecting an appropriate irreducible 
polynomial and basis. An arbitrary multiplier can always be 
constituted by this method. 

(2) XOR-AND-XOR type 
10 Generally, according to the Boolean algebra rule (A and B) 
xor (A and C) = A and (B xor C) , it is possible for the XOR 
calculation in the residue operation unit of the AND-XOR 
circuit to be moved in front of the AND for use as a 
variable preprocessor (input side XOR calculator) . Thus, 
15 the XOR-AND-XOR multiplication circuit can be obtained. For 
moving the XOR calculation, when an even number of the same 
redundant terms are added to the XOR of the residue 
operation unit (output side XOR calculator) by the 
If! properties A xor A = 0 and B xor 0 = B, many XOR 

*?f 20 calculations may be moved to the front-end as a 
pre-processor . Since this operation can be employed, 
various methods for moving the XOR are available, in 
addition to the simple application of the distributive law. 
Therefore, multiple AND-XOR types are present even with the 
25 same basis or irreducible polynomial. The number of gates 
in the XOR-AND-XOR type can vary, and may become either 
greater or smaller than that in the AND-XOR type. Another 
method is known whereby the number of XOR gates is 
systematically reduced by choosing a special basis, such as 
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the Composite Field Multiplier that will be described below. 

(3) A Method for constructing an XQR-AND-XOR type that can 
be applied only for a limited field (Composite Field 
5 Multiplier) 

The composite field multiplier is a multiplier construction 
method that can be used only in a special case, such as 
where m is a composite number and the basis used for the 
representation of an element in a field may not be an 
10 ordinary basis (such as polynomial basis or normal basis) . 
y- This method will now be described in detail. When m is a 

f3 composite number, extension field GF(2 m ) can be constructed 

by two fold extensions of the field GF(2) • The composite 
I** field multiplier method is a method for constructing, in 

J e jj 

jj* 15 accordance with the extension process, a multiplier having a 

PI 

recursive structure. At this time, when the product of the 

P two values Ax + B and CX + D in GF(2 m ) (A, B, C and D are 

}i[ values in the sub-field GF(2 ra/2 )) 

Cfl (Ax + B) (Cx + D) = ACx 2 + (((A+B) (C+D) + AC + BD)x + BD) 

m 

1*1 20 is employed for one quadratic extension, the number of 

HP 

multiplications performed in the sub-field can be reduced 
from four to three, and the circuit size can be reduced 
(KOA) . At the same time, the circuit can be provided as an 
XOR-AND-XOR structure (the addition performed before the 
25 multiplication corresponds to the XORs arranged in front of 
AND) . It should be noted that the use of KOA is premised on 
the use of the composite field multiplier, otherwise, KOA 
can not be used. If the value of m is the composite number 
for which this method can be applied, a field converter is 

36 
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required, and the circuit size is increased because of the 
overhead. Thus, when a circuit is to be prepared for only a 
single multiplication operation, the circuit structure does 
not generally, employ the composite field multiplier. 

5 

Configuration of a combinational circuit according to the 
invention using ordinary multipliers 

Figs. 7 and 8 are diagrams showing a configuration wherein 
multipliers with common input and a logical sum calculation 
10 circuit for the invention are constituted by a common 
□ AND-XOR type. In the example in Fig. 7, a combinational 

*p circuit using two multipliers is shovin. As is shown in Fig. 

M 7, a first input Al is transmitted to both a multiplier 40 

>; and a multiplier 42, while a second input Bl is transmitted 

Q 15 to the multiplier 40 to obtain a first output 45, and a 
Pi third input B2 is transmitted to the multiplier 42 to obtain 

W a second output 46. In the conventional structure in Fig. 

o 

Ifl 7, even though the multipliers receive common input, no 

O circuit exists that these multipliers can employ in common. 

?|8 
? %f 

■ 20 In the example in Fig. 8, the conventional multiplier 
structure is employed for a combinational circuit that 
performs logical sum calculations. It is again apparent 
that in Fig. 8 no circuit exists that the multipliers can 
use in common when performing logical sum calculations. 

25 

Fig. 9 is a diagram showing a combinational circuit 
employing conventional multipliers. In Fig. 9, one symbol 
is represented by one line, and 8-bit width input and output 
are assumed. The combinational circuit in Fig. 9 includes 
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six-symbol input and one-symbol output, and seven 
multiplication circuits and five addition circuits. In the 
combinational circuit in Fig. 9, inputs SO, . . . and S3Q 
are transmitted to multipliers 46 and are added together by 
5 adders 48. The results are then transmitted to a logical 
sum calculation circuit 50 whereat an output L21Q, the 
logical sum of the inputs, is generated. 

In Fig. 9, a combination of cross-term generation operation 
10 and residue operation, indicated by broken lines, 
corresponds to one multiplication. Since a standard circuit 
structure is used for the multipliers shown in Fig. 9, no 
detailed explanation will be given for the circuit. The 
combinational circuit, including the multiplication circuit, 
15 includes 64 AND gates and about 103 XOR gates, and for the 
entire circuit, the number of gates is 448 ANDs + about 761 
XORs. As is apparent from Fig. 9, for almost all the 
multipliers, one or both inputs are used in common by 
ff| another multiplier. Further, the logical sum calculation is 

20 performed at the final stage. 

Table 2 shows the numbers of gates that are included in 
multiplication circuits for a Galois extension field GF(2 8 ). 
They are conventional multiplier which has two-stage AND-XOR 
25 structure, a composite field multiplier, and a 
multiplication circuit, for which XOR-AND-XOR is employed 
for the multiplication of a sub-extension field GF(2 4 ) . 



% si 

W 

f "s 

U 

CP 
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Table 2 



CO 



* 



Variable 
pre-process 

None 
4XOR*2 



Standard AND-XOR 
multiplication circuit 

Composite Field Multiplier 

XOR-AND--XOR circuit for performing the multiplication of sub-extension field 
GF(2 4 ) to obtain the effects of the invention 



Cross-term 
generation 

6 4 AND 
4 8 AND 



Residue 
operation 

103XOR 
56XOR 



a. Alteration of all three 22XOR*2 
multiplications of a 

sub-field 

b. Alteration of two out 16XOR*2 
of three multiplications 

c. Alteration of one of 10XOR*2 
three multiplications 



30 AND 

3 6 AND 
42AND 



44XOR 

48XOR 
52XOR 



As can be seen from Table 2, as far as only a single 
multiplier is concerned, the circuit sizes for cases a. to 
c. are increased when compared with the size of the 
conventional composite field multiplier, and are not the 
10 minimum size. As is described above, when the three-stage 
XOR-AND-XOR structure is simply employed for a single 
multiplier, in some cases the circuit size may be increased. 



Multiplier structure for a combinational circuit according 

15 to the invention 

Generally, optimization of Boolean algebra is difficult when 
many multiplications and logical sum calculations are 
performed together. However, in consideration of the use of 
the combinational circuit as the processor 12 for the 

20 invention, many logical sum calculators and many multipliers 
are connected in parallel at multiple stages. The i-th 
logical sum calculator generally receives 0-th to (i-l)th 
outputs. Therefore, since the backend part of the operation 
circuit must process in parallel inputs used in common by 
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almost the entire combinational circuit, the optimization 
range is extended. The present inventors focused on this 
point, and achieved an efficient configuration for a 
multiplier by performing the optimization for Boolean 
5 algebra, while taking into account the balance obtained with 
other operations. 

Fig. 10 is a diagram showing a combinational circuit 
according to the embodiment wherein a three-stage structure 
10 is employed for the multiplier in the combinational circuit 
in Fig. 7, which includes conventional multipliers and 
adders. In the combinational circuit in Fig. 10, an input 
Al is transmitted to a first XOR group 52, an input Bl is 
transmitted to a second XOR group 54, and an input B2 is 
15 transmitted to a third XOR group 56. These XOR groups 52, 
54 and 56 are gates used by this invention to perform the 
variable pre-process. Since the XOR groups 52 process the 
common input Al, the circuit size is reduced. The outputs 
Cf'l of the XOR groups 52, 54 and 56 are transmitted to AND 

20 groups 58, and when a cross-term is obtained, the residue 
operation is performed again by downstream XOR groups 60, an 
output 62 being generated by an XOR group 60a, and an output 
64 being generated by an XOR group 60b. In Fig. 10, the 
unit for performing one multiplication is indicated by 
25 broken line BL, and three stages, a variable pre-processor 
(XOR) - a cross-term operation unit (AND) - a residue 
operation (XOR), constitute one multiplier. As is shown in 
Fig. 10, when the XOR-AND-XOR structure is employed for each 
multiplier, and when for the multipliers the XOR 
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calculations to be performed for an input used in common is 
unified, the XOR calculation circuits can be shared by the 
multipliers. So long as the size of a portion (variable 
pre-processors*2 + new cross-term generation + new residue 
5 operation unit) that corresponds to one multiplier is the 
same or slightly larger than the normal multiplication 
circuit, the size of the entire multiplier group can be 
reduced. 

10 Fig. 11 is a diagram showing a combinational circuit 
y k according to another embodiment wherein a three-stage 

O multiplier used for the invention is mounted for the 

conventional combinational circuit in Fig. 10. In the 
J-* combinational circuit in Fig. 11, inputs 66, 68, 70 and 72 

,p 15 are transmitted to XOR groups 74, 76, 77 and 78 that perform 
* fS? the variable pre-process, and the outputs of the individual 

Q XOR groups are transmitted to AND groups 80 and 82 that 

:*J perform the cross-term generation operation. 

If! 

K 20 The outputs of the AND groups 80 and 82 are transmitted to 

f M 

an addition circuit 84 and are added together, and the 
residue operation and the multiplication are again performed 
by a backend XOR group 8 6 that is used in common, so that an 
output 88 is generated. The structure of one multiplier in 
25 Fig. 11 is formed in a block Bx. The difference from the 
structure in Fig. 10 is that, in the invention, even when 
the XOR groups 74, 76, 77 and 78 on the input side are not 
shared the backend or output side XOR group 8 6 that performs 
the residue operation is used in common. 
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Further, in the invention, when the XOR groups 74 to 78 on 
the input side, which perform the variable pre-process, are 
used in common, and the XOR groups that perform the residue 
5 operation are also used in common, the overall structure of 
the combinational circuit can be simplified. 

Fig. 12 is a diagram showing a combinational circuit 

according to an additional embodiment wherein a three-stage 
10 structure is employed for the multiplier in the 

combinational circuit in Fig. 9. The combinational circuit 
U i n Fig. 12 constitutes a part of the processor 12 in the 

^| decoder (e = 2) in Fig. 6 for error correction by RS codes. 

t\. The combinational circuit in Fig. 12, as does the 

„p 15 conventional example in Fig. 9, receives input SO to S3Q, 

and as is shown in the blocks in Fig. 12, an XOR gate 90 

O that performs the variable pre-process for input SO is used 

hi 

in common by three multipliers that correspond to AND gates 

(fl 92, 94 and 96. A residue operation unit 98 is shared by a 

Q 

?2* 20 plurality of multipliers, and both an XOR gate used for the 
variable pre-process and an XOR gate used for the residue 
operation are used in common. One multiplier is indicated 
by a chained line. In the combinational circuit in Fig. 12, 
eight variable pre-processors , seven cross-term generation 
25 operation units, four residue operation units, two 8-bit 
adders, and three adders having the same bit width as the 
cross-term generation operation unit are provided. 

Therefore, when, based on the multiplication circuits in 
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Table 2, a part of the decoder is to be constituted by the 
combinational circuit in Fig. 12, the number of gates 
required can be determined as shown in Table 3. 



5 Table 3 

Composite Field Multiplier 
Present invention 

a. Alteration of multiplication for all 
three sub- fields 

b. Alteration of multiplication for two of 
the three sub-fields 

c. Alteration of multiplication for one of 
the three sub-fields 

h 

□ It is apparent from a. to c. in Table 3, that even if the 

single multiplier is not intentionally minimized, the 
CO' 10 overall size of the circuit can be reduced. 



416XOR + 336AND 

458XOR + 210AND 

444XOR + 252AND 

4 30XOR + 294AND 



Total 752gate 

Total 668gate 
Total 696gate 
Total 724gate 



* Fig. 13 is a graph showing the number of XOR gates and the 

C3 

number of corresponding multipliers that are required for 
Q use for the processor 12 in the error correction circuit 

m 

m 15 having the error correction capability t = 2 to 8 in Fig. 6. 

ft£ In Fig. 13, the vertical axis represents the total number 

of XORs, and the horizontal axis represents the number of 
multipliers. In this case, for the decoder for error 
correction in Fig. 6, m = 8 and the irreducible polynomial 
20 is x 8 + x 4 + x 3 + x 2 + 1. In this case, 662 multiplications, 
531 additions and 30 square calculations are performed. As 
is apparent from Table 3 and Fig. 13, even if the single 
multiplier is not intentionally minimized, by the 
configuration of the invention, a greater reduction can be 
25 obtained in the overall size of the combinational circuit. 
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In the embodiment in Fig. 13, the same variable 
pre-processor, cross-term generation operation unit and 
residue operation unit were used for all the multiplications 
5 in order to clearly present a specific circuit structure. 
In order to obtain better results when these circuits are 
actually implemented, the number of XORs in the arrangements 
for the variable pre-processor and the residue operation 
unit (i.e., which of a. to c. is to be employed) can be 
10 changed and optimized for each multiplication operation 
yi performed by the circuit. Further, in the embodiment in 

P Fig. 13, since the ratio of the number of operations to the 

ir| number of inputs is small and an overhead for field 

;-* a conversion is also present, no very great effect is obtained 

4; 15 by reducing the number of gates. However, in actuality, 
since as is shown in Fig. 14 the ratio of the operations to 
the input is quite high, because of the effects produced by 
a reduction in the number of gates the circuit size can be 



us? 

tfl dramatically reduced 



fe9# 

rU 



20 



25 



Section 3 Error correction algorithm 

A detailed explanation will now be given for an error 
correction algorithm that is used by the decoding circuit 
and the error correction device of the invention. 

Overview of the conventional example 

A. Conventional method for solving the Yule-Walker equation 
or for obtaining an error locator polynomial, and a problem 
associated with these methods 
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According to the invention, it is necessary to find an 
efficient algorithm for a combinational circuit to calculate 
the following simultaneous linear equation, which is defined 
over GF(2 m ) , 



/ 


So 


Si • 


■ S^i 




( A</) 1 


/ 


Si 


\ 




Si 


s 2 • 


• Si 












\ 


Si-i 


Si • 


' S21-2 


J 


, A <J) , 


V 


S21-1 


J 



where S 0 , Si, . . . S 2t -i are the elements of a given GF(2 m ), 
and A* 2) is an unknown amount. 

In this simultaneous linear equation, the matrix on the left 
has a regular structure wherein the same elements are 
arranged obliquely to the right (the direction intersecting 
the diagonal line) , and is called a Hankel matrix. 
Generally, this type of equation is called a Yule-Walker 
equation, and it is known that this equation is widely 
applied for various fields, such as the error correction 
code theory, time-series analysis and signal processing. In 
the error correction algorithm, the Yule-Walker equation 
appears in the portion for determining an error locator 
polynomial. Therefore, according to the present invention, 
the algorithm for obtaining the solution for the Yule-Walker 
equation is applied as an error correction algorithm for 
decoding the Reed-Solomon codes. 

The well known methods for solving the Yule-Walker equation 
are, for example, the algorithm proposed by Levinson and the 
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algorithm proposed by Levinson-Durbin. These algorithms 
start calculating with a matrix of the smallest size 1 and 
recursively determine the solution of an equation wherein 
the size of a matrix is greater. The number of calculations 
5 required by these two algorithms is of order of l 2 . However, 
these algorithms include divisions in the calculation step. 
This means that when the algorithm is mounted as a 
combinational circuit, conditional branches occur depending 
on whether the denominator is 0 or not. Since a separate 
10 circuit must be prepared for each conditional branch, the 

|*4 required circuit size increases as the size of the matrix 

b 

increases. 

Further, as the object of the invention, especially relative 
15 to the decoding of the Reed-Solomon codes, an error locator 
polynomial is determined by obtaining the solution to the 
Yule-Walker equation. The conventional methods for solving 
the Yule-Walker equation can be, for example, the Peterson 



m 

fit 



tfi method, the Berlekamp-Massey method and the Euclid method. 

20 These methods are used to calculate the coefficients of an 



error locator polynomial by calculations of which the number 
is polynomial order with respect to the maximum number of 
correctable errors t. However, when the Berlekamp-Massey 
method and the Euclid method are represented by a 
25 combinational circuit, the following problems occur. 

First, for the Berlekamp-Massey method, it is inevitable 
that multiple conditional branches should be included in the 
algorithm. Therefore, to expand this algorithm for a 
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combinational circuit, for the same reason as described 
above, the circuit size would be increased in accordance 
with the number of combinations. As for the Euclid method, 
while the multiplication and division of the polynomial are 
5 the essential part of the algorithm and, the degree of a 
polynomial that appears in the denominator of the division 
can not be identified in advance, so that there is room for 
generating a conditional branch. Furthermore, due to the 
conditional branches, the circuit size is accordingly 
10 increased, as it is for the Berlekamp-Massey method. 

H 

;=I B. Policy for calculating the Yule-Walker equation and an 



*B error locator polynomial that is appropriate for a 

frt combinational circuit 



15 Since, as is described above, the Levinson (-Durbin) method, 
the Berlekamp-Massey method and the Euclid method include 

*«J conditional branches, a problem has arisen in how to provide 

UF 

P these methods as combinational circuits. In order to 

implement the Yule-Walker equation by a combinational 

in 20 circuit, an algorithm that has no conditional branching must 
be found, and this is an essential object for the algorithm 
of the invention. 



In this case, the Peterson method known for decoding of the 
25 Reed-Solomon codes can be used as the algorithm of the 
invention. Using the Peterson method, the Yule-Walker 
equation can be solved directly, and the solution of the 
Yule-Walker equation can be represented as determinants by 
the Cramer formula: 



47 



JP920010049US1 



A 0 



50 Si ■•• Sx-i 

51 S 2 "' Si 



r x(i) 
Ai = 



S2-1 Sj S21-2 
Sq Si S1-1 



S 21-1-2 
S2I-1 

S21-1 



,1 = 1, 



5 2-1-1 Si-i 

Si Si+i 

51 S2 Si 

52 S3 Si+i 



Si Sj+i "* S21-1 

Therefore, determinants A ( o i} need be obtained for each 1 = 1, 

^(e) 

. , t, and the determinants A if i = l r ...,e need be 
calculated for the number of errors e. 



However, when the expansion of the determinants is provided 
by using a circuit, the required number of multipliers is 
dramatically increased as t is increased, so that it is 
difficult for the determinants to be directly expanded. 
Therefore, in this invention, the number of calculations is 
reduced by the recursive structure of the Hankel matrix. 
The calculation of Af t(i) using the Katayama-Morioka method 
will be explained. 
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i y 



When the calculation algorithm for A ( / } in the 
Katayama-Morioka method is written for 1 = 1 to 1 = 4, the 
form in Fig. 14 is obtained. 

5 For comparison, a method devised by Koga will now be 
described as another method for recursively calculating the 
Hankel matrix. According to the method by Koga, new error 
locator polynomial i ; i + 2uD(X, Y) is defined, 
wherein 



was 

0 ;;i + 2 U D(X, y) = 

a io 



r Si+YX 1 S 1+1 +YX i+1 ••• S i+u + YX i+v 
S i+ i + YX i+1 S i+2 + YX 1+2 ••• S i+u+ i + YX i+v+l 



"f\ To calculate this new error locator polynomial, the Hankel 

matrix i ;i+ 2uQf which has the i-th syndrome Si as the (1, 1) 
element, is employed, and a determinant obtained by 
15 symmetrically removing multiple rows and columns from this 

in 

□ Hankel matrix is defined as a Q determinant. When the 

subscript numbers of syndromes that appear as a diagonal 
element are designated in order beginning at the upper left, 
only one Q determinant can be determined. In this case, the 
20 Q determinant can be represented by the row of subscripts 
[a x , a 2 , . . . r a p ] . According to the Koga method, an 
algorithm is presented by to calculate error locator 
polynomial i ;i+ 2uD(X, Y) the Q determinant. 

25 All of the above described conventional methods have the 
following problems. First, for the algorithm used in Fig. 
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14 for calculating 




new terms sequentially appear on the 



right due to the asymmetry of determinants to be calculated, 
and as a result, as the size of a matrix is increased, the 
number of multipliers that is required is increased 
5 accordingly. Thus, an algorithm for which the combinatorial 
increase of the number of multipliers is as small as 
possible is preferable. 

As for the Koga algorithm, the Q determinant defined by Koga 
10 is symmetrical, and a reduction in the number of multipliers 
H is carried out. However, there is a limitation on the use 



of the Koga algorithm; this algorithm can be applied for BCH 



^ 15 algorithm that this limitation can be eased, such an 



application example is limited to the binary, narrow sense 
BCH code. 



According to the invention, since the combinational circuit 



20 is applied for an optical communication system, it is 

required as an object that the decoding of the (255, 239) 

Reed-Solomon code (minimum distance = 17) be efficiently 

carried out by the combinational circuit. Therefore, an 

algorithm is needed that can perform an efficient 

25 calculation by a specific method, regardless of whether the 
minimum distance is an odd or even number, 

C. Definitions of terms used for this invention 

Before a detailed explanation is given for the algorithm of 




codes or Reed-Solomon codes only when the minimum distance 
is an even number. Although it is disclosed in the Koga 
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this invention, definitions for the terms used for this 
invention will be given. 

(1) Syndrome 

5 Generally, when a primitive element of the Galois extension 
field GF(2 m ) is defined as a, and h < 2 m -l is a positive 
integer, a 2 m -element cyclic code that has a code length of n 
= 2 m - 1 and that employs 
G(x) = (x-1) (x-a) (x~a 2 ) . . . (x-a^ 1 ) 

10 as a generator polynomial is defined as the Reed-Solomon 
codes. That is, when k = n - h and when the k-th degree 
polynomial having k information symbols as its coefficients 
is defined as M(x), M(x) and x n " k are multiplied, and the 
result is divided by G(x), as follows, to obtain the residue 

15 R(x) . 

M(x)x n " k = Q(x)G(x) + R(x) 

Then, the polynomial (transmission polynomial) that has as 
coefficients a coded sequence having the length n is defined 
20 as 

W(x) = M(x)x n " k - R(x) = Q(x)G(x). 

At this time, the coded transmission sequence is represented 
as systematic code, of which k information symbols are 
located on the left and h = n-k check symbols follow these 
25 symbols. The minimum distance d m i n = h + 1 of the 
Reed-Solomon code, and the maximum number of correctable 
errors t = [h/2] are provided. 

The following decoding algorithm is given for making an 
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estimation of the original transmission sequence based on a 
received sequence. 

(2) Calculation of syndromes and detection of an error 
Assume that errors have occurred, and that the locations of 

5 the errors are denoted by i 0 , . . . , ii- x and the error values 
are denoted by E u , . . . , Ei^ . A polynomial having £ io , ...^i w 
as coefficients is defined as 
E(x) = E u x i «+. . .+E i2 ^ 1 K i ^ , 

a polynomial having a received sequence of b 0 , . . . , b n -i as 
10 coefficients is provided as 

where Y(x) is defined as a reception polynomial. 

Then, the reception polynomial Y(x) is employed to calculate 

the following syndromes: 

15 Si = Y(a L ) s GF(2 m ), i = 0, 1, . • . , 2t - 1 . 

In this case, since W(a x ) = 0, i = 0, 1, . . . , 2t-l is 
established, and the obtained syndromes satisfy Si = E(a x ), i 
= 0, 1, . • . f 2t-l. Therefore, when there is no error, all 
the syndromes are 0, so that the value of syndromes can be 

20 used to determine the presence of errors. 

(3) Determination of the number of errors and the location 
of errors 

Assume that the number of errors that occur is 1 and that 
25 the location of the error is i 0 , . . ii-i, i.e., assume 

that the values of b iof ...,b il _ l are incorrect. In order to 
determine the number of errors 1 and the error locations 
represented by i 0f . . ii~i, the following polynomial, 

having a~ io , . . . , a'* 1 - 1 as roots, is defined: 
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A< J >(^)=ff(l-a i ^) = l+A^+ . . . +A ( J i V- 1 +A ( 1 i) x i 

where a~ io , . . . , a' 11 - 1 are called error locators and A (1) (x) is 
called an error locator polynomial. 



5 Further, A^, . . . , A ( / } are coefficients of the error locator 
polynomial and are provided using the elementary symmetric 
function of a u f . . . , a £l ~ l . 



O 

Si. 

'. : e 

z ,. 



ffi 



The unknown quantities A^ , . . . , Aj 2) 
10 simultaneous linear equations: 



satisfy the following 



r 



So 
Si 



Si 

s 2 



Sx-i 
Si 



\ Sj-i Si S21-2 y 



Si 



V 5 2I-1 J 



This is nothing but the Yule-Walker equation explained in 
section A. While at this step "1" is an unknown number, it 
is known that when the number of errors that actually 

15 occurred is 1 £ e £ t, the Hankel matrix on the left is 
regular when 1 = e and is irregular when t ^ 1 > e. 
Therefore, for 1 = 1, . . -ft, only the determinants for 
the Hankel matrix on the left need be calculated, and a 
maximum integer that is not 0 can be defined as the number 

20 of errors e. When the above equation is solved with 1 = e, 
the error locator polynomial can be obtained. 



In this invention, to specify error locations, the error 
locators, i.e., the roots of error locator polynomial A (e) (x) 
25 = 0, need be calculated. For this calculation, a method can 
be employed whereby a" 1 , i = 0, 1, . . . , n - 1, is 



53 



JP920010049US1 



substituted in to determine whether the result is the zero 
point of the error locator polynomial. This method is 
called the Chien search method. When the zero point of the 
error locator polynomial is denoted as a~ u , . . . , a~ ie ~ l , i 0 , . . 
5 . , i e _i provide the actual error locations. 



y 

fft 



war 



ill 



in 



10 



(4) Calculation of error values 

Error values can be obtained by solving the following 
Vandermonde simultaneous linear equations: 



f 



,11 



1 

a ie-l 



a io(e-l) ... a ie-i(e-l) j 



r 



\ E ^ J 



So 



\ 6 ^ J 



In this case, polynomial S(x), having a syndrome as a 
coefficient, is defined as 
S(x) = So + Six + S 2 t~iX 2t ~ l , and 
Q(x) = A (e) (x)S(x) mod x 2 ^ 1 
15 is defined, wherein Q(x) is called an error evaluator 
polynomial. In this case, the solution for the Vandermonde 
simultaneous linear equation can be obtained by calculating 

,e-l 

This is called the Forney algorithm. When the error 
20 locations and the error values are obtained, only these need 
be subtracted from an input digital signal, so that a 
digital signal for which errors were corrected can be 
output . 



25 D. Algorithm to solve Yule-Walker equation of this 
invention 
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The object of the present inventors is to find an efficient 
algorithm that employs the combinational circuit to obtain 
the solution for the following Yule-Walker equation, defined 
over GF(2 m ) : 



So 



Si-! ^ 



^ Si-i S21-2 Ai i} J \ S21-1 J 
where S 0/ Si, . . . , S 2t -i are the elements of a given GF(2 m ), 
and A ( / } are unknown amounts. 



f 



9 
frt 



In this invention, by the Cramer formula the solution for 
10 the Yule-Walker equation is represented as the determinant 
form shown in Fig. 15, and a recursive structure is employed 
to obtain an efficient method for calculating determinants. 



In order to calculate determinants in Fig. 15, the focus in 
15 the invention is on the following Jacobi's formula. 
<Jacobi ' s f ormula> 

A = (aij) is defined as an order n square matrix on a 
commutative ring having a unit element of 1, and the 
cof actor (i, j) of A is defined as Aij. When A$ is defined 
20 as the cofactor of minor A$ , with a set of subscripts, 

Vi = {ii/ . . . / irf (ii < • ■ • < i r )}f v = . . ., j r , (ji 

< . • . < jr) }, 

the following equation is established. 



25 



Ai Ul A llJx 



Ai r ji " Ai rJr 
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ftfk 

m 

ft 

C3 
S3 



In this invention, the following equation 

Aixj, - A i2j2 -A i2jl • A iU2 =(det A)4T 2) 

can be employed, which holds when 

p = {i lf i 2 } , v = { ji, j 2 } . 

The calculation of Af t(J) will now be described using 
Jacobi j s formula. 

First, A^ at(I+1) is represented in the following form. 



10 A ( 0 I+1) : 



5 0 Si 

51 "•• 

Si-i - 



•52 2-2 ^22-1 
522-1 ^22 



By careful examination of this determinant, A* at(i) is obtained 
by removing the (1 + 1 - i)th row and the 1-th column from 
j^hatii+i) ^ and jtetti) is obtained by removing the 1-th row and 

the 1-th column. That is, since Af t(i) andAf t(1) are (1 + 1, 
15 1+1) and (1+1, 1+1-i) cofactors of A ( 0 i+1) respectively, 
T (i) a _ a _ ?W 

A 0 , Ai+i, - Ai+i-i., 1+1 - , 



Ai+i, 1+1 

while ii = ji = 1 + 1 - i, 12 



j 2 = 1 + 1 is set in Jacobi ! s 



formula , 



Further, the (1+1-i, 1+1-i) cofactor of A 



hat(l+l) 



is 



defined as r ( / +1) , the structure of which is shown in Fig, 16. 
20 And by Jacobi 1 s formula, the following equation is obtained: 

ir>X? + (A?) 2 = A ( o +1) rf\, i = 1 1. 



When Jacobi ! s formula is employed, the calculation of A^ t(J) 
results in the calculation of f which is the 

25 determinants of symmetric matrices. It should be noted, 
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however, that, to obtain Af t(i) , not only the calculation of 
r ( / +1) but also the 2x1 multiplications and the calculation 
of 1 square roots are required. Since the calculations for 
the square root and the square calculations can be performed 
5 as linear calculations for GF(2 m ), these calculations can be 
implemented as a circuit substantially at the similar cost 
as that of an addition. Therefore, only a very small cost 
is required, compared with a multiplier that is a non-liner 
operating circuit. Therefore, the present inventors focused 
10 on only the multipliers, and discussed the number of them 
fr* that would be required. Since the characteristic of the 

q GF(2 m ) is always 2 and all F ( p are symmetrical, the algorithm 

Stt proposed here always cancels terms that are generated from 

f ft arrangements that are asymmetrical to the diagonal line in 

15 the process for expanding the cof actor of determinants. For 
« example, when the cofactor expansion is calculated for a 3 x 

3 symmetric matrix, the following equation is obtained 



PI 



a b c 
b d e 
c e f 



= adf+ae 2 +b 2 f+bec+ c 2 d+bec-adf^ae 2 + b 2 f+ c 2 d 



Since the term "bec M , which is generated from the 
20 arrangement asymmetrical to the diagonal line, always 
appears twice, this term is canceled. Thus, when the 
algorithm of the invention is used for a combinational 
circuit including multipliers, the required number of 
multipliers can be reduced. 

25 

The general form of the algorithm for the recursive 
calculation of , 1 = 1, 2, . . . , t+1, i = 0, 1, . . ., 
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t, is provided as follows, 

0. rf } = 1, r ( 0 2) = So, if = s 2 

1, when 1 > 2, i = 1 , 

5 2. when 1>2, i = 1, . . . . 1-1, 

first, one auxiliary amount for describing the algorithm is 
defined. 



When {ii, 
10 det[{i i/ . 



. , i n } is defined as a set of subscripts, 
, i n }] is defined as 



det[{ii, . . - , i n }] = 



Si 2 Sziz-i! 



- Si. 



Specifically, det[{ii, . . . , i n }] is the determinant of a 
symmetric matrix wherein the first row is s llf ... f S ln and the 
(p, q) element is S^i^ . This determinant is obtained by 
15 symmetrically removing several rows and columns from the 
Hankel determinant A ( 0 J> . And r ( / } is calculated by using 



det [{i lr 



, i n }] as follows. 



1-1 



n = S2i-2li-i + 2 . S^_2_ fc det[{0 , 1, . . . , l-2}-{l-l-i, 1-1-*}] . 

In this equation, det[{0, 1, . . ., 1-2} - 1-1-k}] 
20 is the determinant for a symmetric matrix that is obtained 
by symmetrically removing, from r ( / _1) , 1-1-i and 1-1-k rows 
and 1-1-i and 1-1-k columns. Note that, when k = 1 and i = 
1, the determinant matches expression of r£~ 2 2) , r£i" 2 2) . 



25 3. Generally, det [ { }1 is calculated as follows. 
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det[{ii , . . . , in}] = 

S 2 i„-ixdet[{ii, ... r i n -i}]+%Sl_ il+ik det[{i lr . . . , i„-i}-{i*}] • 

g Application of the algorithm of the invention for the 

decoding of the Reed- Solomon code 

5 An explanation will now be given for the embodiment wherein 
the algorithm of the invention for solving the Yule-Walker 
equation described in D is applied for the Reed-Solomon 
codes. Generally, it is assumed that the order of the 
Yule-Walker equation (the number of unknown quantities) is 

10 known. However, for the decoding of the Reed-Solomon codes, 
since the order is also unknown, this must also be 
determined. 

M) Calculation of T { P 
15 When a sequence of syndromes, S 0 , Si, . . •, S 2t -i, is 
provided, 

Tf , 1 = 1, 2, . . ., t+1, i = 0, . . ., t 

is calculated in accordance with the algorithm explained in 
D. During this calculation, 

20 a? = rr } , 1 = 1 t 

is also calculated. It should be noted that for fast 
decoding of the Reed-Solomon code, the present inventors 
have taken into consideration the fact that the algorithm is 
implemented as a combinational circuit. However, use of the 

25 error correction algorithm of the invention is not limited 
to the combinational circuit; it can be employed as an error 
correction device by a sequential circuit. 
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(2) Determination of the number of errors 

Assume that the number of errors that actually occurred is 
represented as e. Based on the value of 
a (j) - r u+1) 1=1 t 

5 e can be obtained as the maximum IT 1" that satisfies Ao at(i) ^ 
0. 

(3) Determination of an error locator polynomial 

When e < t is established as the result of the determination 
10 of the number of errors, since Ao at(efl) =0, in accordance 
with the algorithm of the invention, the above equation can 
be simplified as follows: 

Af = 1 /?pXf / i = l e . 

Since the error locator is the zero point of the error 
15 locator polynomial, the error locators are values unchanged 
by multiplication of the coefficients of the error locator 
polynomial by a constant. Therefore, the following quantity 



can be used instead of Af* t(e) . In other words, the 
20 multiplication appearing in the above equation is not 
required- When e = t is established, the error locator 
polynomial is calculated in accordance with the following 
equation: 

Ai ; A 0 +A 0 = . . e. 

25 At this time, according to the algorithm proposed by the 
present inventors, in appearance, syndrome S 2t , which can not 
be calculated, seems to be necessary when the minimum 
distance is an odd number (= 2t + 1) . However, since the 

60 



JP920010049US1 



equation of this invention serves as the identity of the 
syndromes, it can also serve as the identify of the syndrome 
S 2 f Furthermore, since Af m does not include syndrome S 2t , 
when syndrome S 2 t appears during the cofactor expansion for 
5 A2 at(t+I) and r ( / +1) , it should always be canceled. Specifically, 
since a term that includes S 2t and that appears during the 
cofactor expansion of rf +1) is r£\s 2t , when r ( / +1) Af t(t) is 
expanded, a term including S 2t is Ao at(t) r£\s 2t - Further, since, 
of the terms that appear during the cofactor expansion of 
10 A£ at(t+1) , Ao atCt) S 2t includes S 2t , a term including S 2t when A ( 0 t+1) r^i 
is expanded is Af t(t ¥£\s 2t • Therefore, all these terms must 

r! 

m always be canceled. 



W 

m 

S P 15 during the cofactor expansion of A ( 0 t+1) and r* t+1) , terms having 

U. 

s S 2 t as a coefficient need not be calculated. In this manner, 

b 

Q Reed-Solomon codes having an arbitrary minimum distance. 

j at 

w 

5pa? 



It is therefore understood that, of the terms that appear 



the algorithm of the invention can be applied for 



Further, since the multiplication of the term including S 2t 
20 is not necessary, from the viewpoint of the reduction in the 
number of multipliers, the algorithm of the invention is 
superior to the Koga algorithm. In addition, as is 
described above, the calculation of a square root can be 
implemented as a circuit at the same cost as that for 
25 addition, and only a small cost is required compared with 
the cost of a multiplier. 

F. Example of application of the error correction algorithm 
of the invention for the decoding of Reed-Solomon codes 
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An explanation will now be given for a case wherein the 
error correction algorithm explained in E. is employed for 
the decoding of Reed-Solomon code for t = 4. When t = 4, 
according to the invention the following equations are 
5 determined. It should be noted that for simplification the 
determinant is represented as det[{i if . . . , i n }] = detii . 



(1) Calculation of IT , i = 0, . . 1-1, .... 5 

The calculation results obtained by the invention are shown 
10 in Fig. 17. 

(2) Determination of the number of errors 

Since Xj 0 = r ( 0 i+1) , 1 = 1, . . 4 is determined by rf , 

obtained in (1), and the number of errors e can be 
15 determined as the maximum 1, 1=1, 2, 3, 4, that satisfies 

A 0 * 0. 

(3) Determination of an error locator polynomial 

When, for example, e = 2 is ascertained using the 
20 calculation in (2) , the error locator a u , a 11 can be 
obtained by solving the following algebraic equation: 



When e = 4 is ascertained, as is described above, the error 
locator can be obtained by 



It should be noted that, as is described above, the term 
including the syndrome S 8 need not be calculated when the 

r ( / } ,r ( 0 6) = A 0 5) calculations are performed. 



i 



n • 
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Fig 18 is a schematic flowchart for the error correction 
algorithm of the invention. In the error correction 
algorithm of the invention, first, at step 200 syndromes S 0 , 
5 , S*-i are input, and at step 201 an error locator 

polynomial r is calculated. When Tf if* are obtained, 

at step 202 the number of errors is determined to be the 

£ . „ <hat(m) _ tp&h-i) ± q Then, at 
maximum integer m that satisfies A 0 - T 0 * 

step 203, a check is performed to determine whether the 
10 number of errors e is equal to the maximum number of errors, 
and when e = t (yes), at step 204 an error value is 

calculated using iT-AT" iTM^"*?** - Whereas when 

e * t (no), at step 205 an error value is calculated by only 
r( efi)_ A hat(e) T^. and at step 206, A 0 # • • • / Ae 



15 obtained 



G Calcination circuit w hpn the algorithm of the invention 
^ aB Elied for the cal culation of an ^rror locator 
pol ynomial 

20 Fig 19 is a block diagram showing a circuit for calculating 
an error locator polynomial based on the algorithm proposed 
by the invention. Fig. 20 is a diagram showing a circuit 
for calculating an error locator polynomial by the algorithm 
of the invention. This circuit comprises a {rf } 

25 calculation block 100, a circuit block 102 for calculating 
the number of errors and a circuit block 104 for determining 
an error locator polynomial. 

The functions of the blocks in Fig. 19 will now be 
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described, A series of syndromes that a sequential circuit 
has obtained using the input digital signal are transmitted 
to the circuit block 100. In the circuit block 100, these 
syndromes yield Tf } , m = 1, 2, . . t+1, i = 0, . . . , t, 
in accordance with the algorithm of the invention. This 
corresponds to (1) for the detailed explanation of the 
algorithm. 

Following this, the circuit block 102 employs the obtained 
value , m = 1, 2, . . . , t + 1 to calculate the number of 
errors e, and outputs F** 1 * , i = 0, . . . , e, which 
corresponds to the value of e. When e = t, in addition to 

the above, r ( 0 t+2) =A ( 0 t+1) is also output. This corresponds to 
detailed explanation (2) for the algorithm. The circuit 
block 104 then employs T^ 1 ^ , i = 0, . . e to calculate 

the coefficients of the error locator polynomial. This 
calculation is performed in accordance with the process 
corresponding to detailed explanation (3) for the algorithm. 

The algorithm of the invention has been used for the 
combinational circuit in order to perform fast decoding of 
the Reed-Solomon codes. However, the algorithm of the 
invention can also be used for a sequential circuit in order 
to reduce the circuit size. 

H. Circuit size when the algorithm of the invention is used 
for the decoding of the Reed-Solomon codes 

An explanation will now be given for the size of a circuit 
when the algorithm of the present invention is used for the 
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decoding of the Reed-Solomon codes. As is described above, 
the calculation of square roots and the calculation of 
squares can be performed by a circuit having substantially 
the same cost as an addition circuit, and compared with a 
5 multiplier , the cost required is very small. The present 
inventors have focused only on the multipliers, and 
discussed the number of multipliers that are required. 

Table 4 shows the number of multipliers required by the 
10 algorithm of the invention in a range extending from t = 1 
to t = 8. In Table 4, for comparison, the number of 
multipliers required for each of the conventional examples 1 
and 2 is also shown. 

15 Table 4 

Comparison of the number of multipliers 

Maximum 12345 6 7 8 

number of 
correctable 
errors t 

Algorithm for 0 3 17 48 117* 255* 548* 1111* 

conventional 

example 1 

Algorithm for 2 9 22 49 98 189 351 640 

conventional 

example 2 

Algorithm of 2 7 21 46 94 179 331 597 
the invention 

Note: a value having an appended * is an estimated one 

20 As is apparent from Table 4, while taking the required 
number of multipliers into account, the algorithm proposed 
in this invention is superior in all number of errors t to 
the algorithm (conventional example 2) proposed by Koga. 
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Further, the use of an algorithm for the decoding of (255, 
239) Reed-Solomon code (t = 8) is especially important for 
the optical communication field; however, the Koga algorithm 
can not be so employed because the minimum distance of 
Reed-Solomon code is an odd number (= 17) . Since the 
algorithm of the invention can be used for Reed-Solomon 
codes having an arbitrary minimum distance, it can also be 
used for (255, 239) Reed-Solomon code. This is shown in 
Table 5. 

Table 5 

Comparison of the application ranges of the Koga algorithm 
and the algorithm of the invention 



Minimum 
distance of 
code 




15 


16 


17 


18 




Koga 

algorithm 




X 


0 


X 


o 




Proposed 
algorithm 




O 


o 


o 


o 





(x indicates the algorithm can not be used, and O indicates 
the algorithm can be used. While the Koga algorithm can be 
used only for code having an even minimum distance, the 
proposed algorithm can be used for code having an arbitrary 
minimum distance. (255, 239) Reed-Solomon code standardized 
by the ITU has a minimum distance of 17.) 

The calculation algorithm in conventional example 1 
(Katayama-Morioka) can also be used for Reed-Solomon code 
having an arbitrary minimum distance. However, from the 
viewpoint of the required number of multipliers into 
account, when t is equal to or greater than 4, the algorithm 
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proposed in this invention requires a smaller number of 
multipliers than does the algorithm of conventional example 
1. It has especially been found that when t = 8, the 
algorithm of the invention can reduce the number of 
multipliers by about 50%. And as for a circuit size, when t 
= 8, 10K gates are currently required for the calculation of 
the error values. For conventional example 1 about 80K 
gates seem to be required, while the employment of the 
algorithm of the invention can reduce the gates for the 
calculation of an error polynomial to about 40K gates. 

Fig. 20 is a schematic diagram showing an error correction 
device according to the invention. The error correction 
device in Fig. 20 comprises: an encoding block 110, for 
receiving and encoding a digital signal; an input block 112, 
for receiving the encoded digital signal ID and for 
calculating syndromes; a process block 114, including a 
decoding circuit; and an output block 116, for correcting an 
error using an error location and an error value that are 
output and for outputting the resultant digital signal OD. 
The encoding block 110 receives the digital signal, which is 
transmitted by interleaved wavelength division multiplexing, 
converts the signal into Reed-Solomon codes, for example, 
and transmits the encoded digital signal to the input block 
112. The input block 112 employs a sequential circuit to 
calculate syndromes for the received digital signal, and 
transmits the syndromes to the process block 114. 

The process block 114 includes a decoding function employing 
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the algorithm of the invention, and calculates error 
locations and error values. The error locations and the 
error values that are obtained are transmitted to the output 
block 116, the error is corrected, and the resultant digital 
5 signal is output. The above described error correction 
circuit can be provided as an error correction device 
comprising multiple hardware components, or a semiconductor 
technique may be employed to provide a semiconductor device, 
such as an ASIC, for which the individual functional blocks 
10 of the error correction circuit are implemented on a silicon 
Ui wafer. In addition, the algorithm of the invention can be 

£3 mounted as firmware for the error correction device, or may 
k be provided as a computer-readable program that is recorded 

on a storage medium, such as a floppy disk, a hard disk, an 
15 optical disk or a magneto-optical disk. The program of the 
$3 invention may be written in an arbitrary object-oriented 

p language or a programming language such as C, and stored on 

the above mentioned storage medium. 



m 

fit 



20 As is described above, according to the present invention, 
it is possible to provide a decoding circuit and error 
correction method that can extremely efficiently correct 
errors in the fast optical communication field. 



25 Description of the Symbols 
10: Input unit 
12: Processor 
14: Output unit 
16: Syndrome calculator 
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18: Error locator polynomial calculator 
18a: Register 

20: Error value polynomial calculator 
22: Register 
5 24: AND gate 
26: XOR gate 
28a, 28b: imn buffer 
40, 42: Multiplier 
45a, 45b, 62, 64: Output 
10 46: Multipliers 
^ 47: Adder group 

O 52, 54, 56, 60a, 60b: XOR group 

P 

P| 60: Downstream XOR group 

66, 68, 70, 72, 77, 78: Input 
15 80, 82: AND group 
84: Addition circuit 
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